Rank in Wordlist | Frequency | Word |
---|---|---|
2648 | 40542 | it,” |
2904 | 36886 | 1,000 |
3937 | 26029 | 10,000 |
4656 | 21180 | 100,000 |
4870 | 20188 | 2,000 |
5035 | 19450 | 5,000 |
5085 | 19231 | that,” |
5473 | 17581 | them,” |
5875 | 15991 | me,” |
6101 | 15287 | time,” |
Rank in Wordlist | Frequency | Word |
---|---|---|
4655921 | 1 | carbohydrates(60%-70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
526092 | 19 | .) |
628618 | 15 | Sunn O) |
628619 | 15 | Sunn O)) |
628620 | 15 | Sunn O))) |
1460312 | 4 | New York Knicks) |
4073702 | 1 | Photo)''2face |
Rank in Wordlist | Frequency | Word |
---|---|---|
7307 | 12133 | 100% |
8023 | 10728 | 10% |
8220 | 10368 | 50% |
8706 | 9631 | 20% |
10828 | 7182 | 30% |
10834 | 7177 | 5% |
11045 | 7006 | 25% |
11106 | 6959 | 40% |
11338 | 6753 | 1% |
11808 | 6423 | 2% |
Rank in Wordlist | Frequency | Word |
---|---|---|
4993 | 19651 | S&P |
11425 | 6691 | AT&T |
14753 | 4736 | A&M |
16225 | 4132 | R&B |
17456 | 3722 | Q&A |
18118 | 3532 | R&D |
20232 | 3024 | J&K |
20530 | 2961 | Y&R |
21300 | 2814 | B&B |
28720 | 1820 | PG&E |
Rank in Wordlist | Frequency | Word |
---|---|---|
59648 | 605 | US$1 |
76735 | 406 | A$AP |
79064 | 388 | US$5 |
79643 | 383 | US$100 |
83378 | 357 | US$2 |
90505 | 314 | US$10 |
92557 | 303 | US$50 |
98647 | 274 | US$200 |
101649 | 261 | US$3 |
111493 | 225 | US$500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
176 | 459836 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
588 | 170131 | it's |
638 | 159102 | don't |
758 | 136661 | It's |
1105 | 97283 | I'm |
1480 | 73894 | didn't |
1795 | 61502 | that's |
1982 | 55572 | .' |
2002 | 54969 | doesn't |
2211 | 49181 | can't |
2218 | 49024 | you're |
Rank in Wordlist | Frequency | Word |
---|---|---|
52275 | 740 | Galaxy S9+ |
103671 | 252 | APNU+AFC |
105491 | 245 | A+E |
110999 | 226 | 2+2 |
165252 | 120 | A+E Networks |
171926 | 113 | Galaxy S8+ |
180324 | 105 | P5+1 |
185643 | 100 | HTC U12+ |
196567 | 91 | 12MP+5MP |
214810 | 79 | 10+2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
176876 | 108 | E*TRADE Financial |
391894 | 31 | Sagittarius A* |
410000 | 29 | Tree of Life*Or L’Simcha |
570993 | 17 | Grade II* |
630672 | 15 | Wal * Mart |
689088 | 13 | Sgr A* |
710048 | 12 | E*TRADE Financial Corp |
720418 | 12 | NOC * NSF |
753041 | 11 | Grade II* listed |
914497 | 8 | E*Trade Financial |
Rank in Wordlist | Frequency | Word |
---|---|---|
4502 | 22079 | https://t |
5398 | 17856 | and/or |
8196 | 10402 | https://www |
9235 | 8884 | 24/7 |
13944 | 5142 | 9/11 |
15531 | 4397 | 1/2 |
16562 | 4005 | https://en |
18213 | 3502 | P/E |
23829 | 2400 | 1/3 |
23985 | 2378 | 2/3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots